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Interictal FDG-PET (iPET) is a core tool for localizing the epileptogenic focus, potentially 
before structural MRI, that does not require rare and transient epileptiform discharges 
or seizures on EEG. The visual interpretation of iPET is challenging and requires years of 
epilepsy-specific expertise. We have developed an automated computer-aided diagnos- 
tic (CAD) tool that has the potential to work both independent of and synergistically with 
expert analysis. Our tool operates on distributed metabolic changes across the whole brain 
measured by iPETto both diagnose and lateralize temporal lobe epilepsy (TLE). When diag- 
nosing left TLE (LTLE) or right TLE (RTLE) vs. non-epileptic seizures (NES), our accuracy 
in reproducing the results of the gold standard long term video-EEG monitoring was 82% 
[95% confidence interval (CI) 69-90%] or 88% (95% CI 76-94%), respectively. The clas- 
sifier that both diagnosed and lateralized the disease had overall accuracy of 76% (95% 
CI 66-84%), where 89% (95% CI 77-96%) of patients correctly identified with epilepsy 
were correctly lateralized. When identifying LTLE, our CAD tool utilized metabolic changes 
across the entire brain. By contrast, only temporal regions and the right frontal lobe cortex, 
were needed to identify RTLE accurately, a finding consistent with clinical observations 
and indicative of a potential pathophysiological difference between RTLE and LTLE. The 
goal of CADs is to complement - not replace - expert analysis. In our dataset, the accu- 
racy of manual analysis (MA) of iPET (~80%) was similar to CAD. The square correlation 
between our CAD tool and MA, however, was only 30%, indicating that our CAD tool does 
not recreate MA. The addition of clinical information to our CAD, however, did not substan- 
tively change performance. These results suggest that automated analysis might provide 
clinically valuable information to focus treatment more effectively. 
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INTRODUCTION 

It is difficult to differentiate between patients with epilepsy (PWE), 
and those with non-epileptic seizures (NES). The clinical assess- 
ment relies on the report of untrained witnesses or the patients 
themselves. A non-epileptic seizure is defined as the presence of 
external seizure symptoms and/or signs with no electrographic 
features characteristic of epilepsy. Long term video-EEG moni- 
toring has shown consistently that roughly one third of patients 
diagnosed with "medication refractory epilepsy" in fact suffer 
from NES (Kerr et al., 2012a). Because they don't suffer from 
epilepsy, these patients with NES (PWN) are not treated effec- 
tively with anti-epileptic drugs (AEDs). For the majority of PWN, 
the NES are a manifestation of dissociative or conversion disor- 
der in which their psychological challenges manifest themselves 



physically (Marchetti et al., 2008, 2009). A minority of PWN suf- 
fers from organic, non-epileptic maladies that can be confused 
with seizure disorder including, but not limited to, dementia 
and cardiovascular disease (Sahaya et al., 2011). The gold stan- 
dard for the differential diagnosis and pre-surgical assessment 
of epilepsy includes 72 or more hours of video-EEG monitor- 
ing (Cragar et al., 2002; LaFrance and Devinsky, 2004). How- 
ever, 10% of patients admitted for this extensive assessment 
leave with inconclusive results (Kerr et al., 2012a). Considering 
that one sixth of PWE are diagnosed with medication refractory 
epilepsy (Privitera, 2011), improved methods to effectively iden- 
tify PWN who do not benefit from AEDs effectively could reduce 
the morbidity and both the financial and social cost of treating 
epilepsy. 
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Improved diagnostic tools could also help PWE. The diffi- 
culty in ruling out non-epileptic etiologies speaks to the chal- 
lenge of adequately localizing and characterizing each patient's 
epileptic etiology. The major seizure type discriminations are 
focal vs. generalized; partial vs. complex; and lesional vs. non- 
lesional. Each of these key discriminations leads patients down a 
different treatment path. When medication or other novel treat- 
ments like the vagus nerve stimulator fails, as they frequently 
do, the patient is left to consider resective neurosurgery. Recent 
reports have shown that surgery is most effective earlier in the 
course of disease (Engel et al, 2012). Improved diagnostic tools 
could more quickly and effectively diagnose patients with epileptic 
seizures and therefore speed the progression toward considering 
the surgical option. 

Ultimately, our goal is to establish a general, automated 
computer-aided diagnostic (CAD) tool that effectively combines 
clinical information, manual interpretation of EEC and imaging 
technologies as well as automated analysis of interictal FDG-PET 
(iPET), EEG, structural MRI (sMRI), and diffusion MRI for all 
subtypes of epilepsy and NES. To accomplish this, we first must 
develop effective CAD tools that harness the information from 
each modality for a limited set of epileptic localizations. We have 
begun already to address automated analysis of interictal EEG for 
a wide variety of epilepsy subtypes (Kerr etal, 2012a). Others have 
described effective CAD tools that diagnose and lateralize tempo- 
ral lobe epilepsy (TLE) using structural and diffusion MRI (Farid 
et al, 2012; Focke et al, 2012; Keihaninejad et al, 2012). 

The clinical, metabolic, and structural differences between left 
and right TLE can be subtle. Some theories suggest that TLE is 
inherently a bilateral disease. Potentially, due to the strong func- 
tional link between the hippocampi, the only clinical difference 
is that in the aura of patient with left TLE (LTLE) more fre- 
quently includes language dysfunction. Over time, patients with 
LTLE more commonly develop verbal memory deficits, compared 
to non-verbal memory deficits in right TLE (RTLE) (Delaney et al., 
1980; Kim et al, 2003). This functional connection between the 
hippocampi may also lead some patients to be falsely-lateralized 
using scalp EEG because a small seizure onset zone (SOZ) in one 
hippocampus can induce larger scale ictal activity in the con- 
tralateral hippocampus with very little time delay. This can lead 
neurologists to falsely conclude that the SOZ is either bilateral or in 
the contralateral hippocampus. Structural and metabolic imaging 
can reduce these errors by demonstrating that that one temporal 
lobe is asymmetrically affected, as shown by the previous CAD 
tools that lateralize TLE (Farid et al, 2012; Focke et al, 2012; 
Keihaninejad et al, 2012). Studies of the functional connectivity 
of these epileptic networks, however, conclude that there are very 
few, if any, differences between the two lateralizations (Zhang et al, 
2010; Liao et al, 201 1; Morgan et al, 201 1, 2012; Pittau et al, 2012; 
McCormick et al., 2013). Recently, Pereira et al. (2010) suggested 
that more patterns of functional connectivity change in LTLE 
compared to RTLE. However, after patients suffer from intractable 
seizures for 10 or more years, the intrahemispheric hippocampal 
connectivity linearly increases with the duration of disease, sug- 
gesting that over time lateralized disease may become bilateral 
disease (Morgan et al, 2011). Because patients with bilateral hip- 
pocampal disease are no longer considered surgical candidates, 



improved methods to distinguish left and right TLE early in the 
course of disease are needed. 

In this manuscript, we discuss the development of an auto- 
mated CAD tool to diagnose, and lateralize, TLE using iPET. We 
also begin to address how to combine our CAD tool with manual 
analysis (MA) and incorporate it into clinical practice. Using a 
mutual information-based feature selection technique, we exam- 
ine how our methods reveal more about the distributed metabolic 
abnormalities that are associated with the different anatomical 
locations of the epileptogenic focus. 

The realistic goal of CAD tools is to complement, not to replace, 
expert analysis. Therefore, we focus on how clinical information 
and expert analysis can work synergistically with our automated 
technology. To summarize the major clinical differences, patients 
with NES are characteristically females in the third decade of 
life with psychiatric co-morbidities (Sahaya et al, 2011). PWE, 
however, also have significant psychiatric co-morbidities includ- 
ing potentially reduced financial and social independence due 
to the suspension of their driver's and, frequently, professional 
license. Particularly in adult onset epilepsy, age-associated changes 
in metabolism may confound the interpretation of iPET, possibly 
leading to an increased diagnostic uncertainty. It is well estab- 
lished that 80-90% of medication refractory epilepsy is "PET 
positive" (Salamon et al., 2008; Lee and Salamon, 2009).The rate 
of PET positivity in NES has not been studied extensively, there- 
fore the true positive predictive value of iPET is unclear. Although 
these differences in clinical presentation are salient, their quanti- 
tative effect on diagnostic probabilities is unknown. Therefore, 
we also examined how simple clinical information and expert 
manual interpretation can be incorporated into our quantitative 
CAD tool. 

The standard of care for the pre-surgical assessment for epilepsy 
is the manual correlation of iPET with numerous other diagnostic 
modalities. The goal of this assessment is to simultaneously ver- 
ify the diagnosis of epilepsy, characterize the seizure etiology, and 
identify the location and extent of the SOZ. Expert radiologists 
and neurologists can detect metabolic asymmetries indicative of 
the epileptogenic focus or foci (Person et al., 2010). The exact 
threshold at which asymmetric metabolism is attributed to patho- 
logic change or seen as a variant of normal is part of the art of 
neuroradiology (Benbadis et al., 2000; Reuber et al., 2002). Once 
non-epileptic etiologies have been ruled out, our previous work 
demonstrated that the quantitative degree of metabolic asymme- 
try is correlated with surgical outcome (Lin et al, 2007). Surgical 
outcome is improved further when iPET is co-registered to sMRI 
because of improved characterization of the focus or foci (Chan- 
dra et al., 2006; Rastogi et al, 2008; Salamon et al, 2008; Lee and 
Salamon, 2009). These hypometabolic lesions are thought to be 
secondary to increased inhibitory neuron cell death, gliosis, and 
abnormal functional connectivity resulting in altered functional 
metabolism. 

The size of the hypometabolic lesion tends to be larger than 
the SOZ, potentially due to functional changes in nearby tis- 
sue secondary to the presence of the epileptogenic lesion (juhasz 
et al, 1999; Matheja et al, 2001; Henry and Roman, 2011). Such 
reports are major limitations to the wide implementation of iPET 
in epilepsy practices (Barrington et al, 1998; So et al, 2000; Henry 
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et al., 2011). In addition to the limitation of counting statistics, 
that forces the quantitative radioactivity intensity of iPET to be 
less certain in hypometabolic lesions (Kerr and Lau, 2012), the 
biological hypothesis is that the epileptogenic abnormality induces 
metabolic abnormality at the SOZ and also at closely associated 
and/or functionally connected regions (Henry et al., 1990, 1993; 
Sperling et al, 1990; Sadzot et al, 1992; Arnold et al, 1996; Dlugos 
et al, 1999; Bouilleret et al, 2002; Rusu et al, 2005; Nelissen et al, 
2006; Takaya et al., 2006; Lee et al, 2009). The epileptogenic lesion 
commonly is larger and more diffuse in left TLE then right TLE, 
potentially because of the high degree of functional connectivity 
between specialized foci within the left temporal lobe associated 
with language and other functions (Toga and Thompson, 2003; 
Barrick et al, 2005; Iturria-Medina et al, 201 1; Haneef et al, 2012; 
Kucyi et al., 2012). These insights parallel the trend in dementia 
that atrophy starts focally then spreads more quickly to function- 
ally connected regions (Zhou et al., 2012). The limited sensitivity 
of iPET unaligned with sMRI to characterize extratemporal lesions 
may be partly due to the insufficient description of the local func- 
tional network of each extratemporal focus and thereby reduced 
detection of a characteristic pattern of metabolic abnormalities 
associated with each focus. In general, an improved insight into 
the clinical interpretation and value of metabolic abnormalities 
outside the SOZ is needed. To overcome this limitation, the iPET 
analysis is used in combination with other diagnostic modalities 
determine which tissue to resect. 

Clinical description, EEG, MRI, and FDG-PET each describe 
separate facets of the pathophysiological etiology, and therefore all 
play critical roles in the diagnosis of epilepsy, and in the identifica- 
tion of the epileptogenic lesion ( Struck et al. , 20 1 1 ) . Each modality, 
however, also has unique limitations. EEG provides an in-depth 
description of the seizures and interictal epileptiform spikes. These 
seizures and spikes, however, are rare events: only 50% of PWE 
exhibit diagnostic interictal epileptiform spikes and/or seizure 
activity during the first outpatient scalp EEG (Gilbert et al., 2003). 
The characteristic signs of epilepsy in structural and diffusion MRI 
may not be measurable until years after the first seizure because 
these methods require the detection of atrophic tissue and/or sub- 
tle regions of cortical dysplasia (Swartz et al, 1992; Reutens et al, 
1996; Van Paesschen et al., 1998; Liu et al, 2002; Jung da and 
Lee, 2010; Bernasconi et al., 2011; Schmidt and Pohlmann-Eden, 
2011; Dabbs et al, 2012). MA uses the contralateral structure to 
assess if atrophy is present but a certain degree of asymmetry is 
expected (Farid et al, 2012; Keihaninejad et al., 2012). It takes 
years of specific experience in manually analyzing sMRIs from 
PWE to reliably discriminate between normal variation and patho- 
logic changes. Once these relatively large-scale changes in neural 
structure have occurred, it is less likely that both invasive and non- 
invasive treatments will be effective (Engel et al, 2012). iPET can 
localize the epileptogenic lesion without observing rare events and, 
potentially, before changes are detectible on sMRI and/or diffusion 
tensor imaging (DTI) (Theodore et al, 1990; Ryvlin et al., 1991; 
Swartz et al, 1992; Gaillard et al, 1995; Debets et al, 1997; Knowl- 
ton et al, 1997, 2008; Blum et al, 1998; Drzezga et al, 1999; 
Benedek et al, 2004; Carne et al, 2004; Chandra et al, 2006; 
Yun et al, 2006; Uijl et al., 2007; Willmann et al, 2007; Rastogi 
et al, 2008; Salamon et al., 2008; Duncan, 2009; Lee and Salamon, 



2009; Lerner et al, 2009; Liew et al, 2009; Brodbeck et al, 2010; 
Chinchure et al., 2010; Kim et al, 2011; Jupp et al, 2012). As 
discussed above, the presence of metabolic abnormalities outside 
the SOZ, however, complicates the effective localization of the 
SOZ using iPET alone (Henry and Roman, 2011). An improved 
description of these induced changes outside the SOZ may help 
spare healthy tissue from resective surgery. Given the recent report 
that resective neurosurgery for epilepsy is more effective earlier in 
disease (Engel et al., 2012); we believe that iPET may play a crit- 
ical role in characterizing patients with unremarkable MRIs and 
inconclusive EEGs earlier in the course of their disease. 

MATERIALS AND METHODS 
PATIENT DATA 

All of the 105 patients that were included in our analysis were 
admitted to the University of California, Los Angeles (UCLA) 
Seizure Disorder Center's video-EEG Epilepsy Monitoring Unit 
(EMU) between 2005 and 2012. Each patient's diagnosis was 
based on a consensus panel review of their clinical history, phys- 
ical and neurological exam, neuropsychiatric testing, video-EEG, 
iPET, ictal FDG-PET, structural and diffusion MRI, and/or CT 
scan. This multimodal assessment is the gold standard for epilepsy 
diagnosis and localization of the epileptic focus (Cragar et al, 
2002; LaFrance and Devinsky, 2004). The patients included in 
this analysis were chosen because they had an FDG-PET after 
2005; had no history of penetrative neurotrauma, including neuro- 
surgery; were determined by consensus diagnosis to have a single, 
lateralized epileptogenic focus; and had no suspicion of mixed 
non-epileptic and epileptic seizure disorder. These patients were 
diagnosed either with LTLE (n = 39), right TLE (RTLE, n = 34), or 
NES (NES, n = 32). PET images were determined to be interictal 
by clinical findings and concurrent scalp EEG. 

PET and MRI images were acquired according to the best clin- 
ical practices at the time of acquisition. PET/CT studies were 
acquired using a Siemens Biograph scanner. After a minimum fast- 
ing period of 6h, patients received 0.14mCi/kg of 18F-FDG-PET 
intravenously. During the ensuing 40 min uptake period with con- 
comitant EEG monitoring to confirm interictal status, the patients 
waited in a quiet, dimply lit room with their eyes open. PET images 
were reconstructed with an iterative algorithm (OSEM: 2 itera- 
tions, 8 subsets). CT images were reconstructed using filtered back 
projection at 3.4 mm axial intervals to match the slice separation 
of the PET data, and used for attenuation correction. 

COMPUTER-AIDED DIAGNOSTIC TOOL TRAINING AND VALIDATION 

Automated analysis of the iPET records was performed in four 
stages. (1) First, each image was screened for gross structural 
and/or metabolic abnormalities by S.T.N., N.M.R., and/or W.T.K. 
(n = 21). These excluded subjects are not reflected in the sam- 
ple sizes quoted above. (2) NeuroQ (Syntermed, GA, USA) was 
used to segment each brain into 47 regions of interest (ROIs) and 
then to calculate the average radioactivity in each ROI, normal- 
ized by the whole brain radioactivity (Table Al in Appendix). (3) 
The minimum redundancy-maximum relevancy (mRMR) tool- 
box for MATLAB (Mathworks, MA, USA) was used to generate a 
ranked list of the ROI metabolisms (features) within each train- 
ing set that were maximally relevant to the diagnosis of epilepsy 
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and minimally redundant with all higher ranked features (Ding 
and Peng, 2005; Peng et al., 2005). The representative number of 
features to exclude and quantal levels was selected based on our 
method discussed previously (Kerr et al, 2012a,b) (see below). 
In each of the training sets, the feature ranking was determined 
exclusive of the test patient's data. We expect the ranked lists to 
be similar, but not identical, across training sets. For purely illus- 
trative purposes, the full dataset was used to create the ranked 
list in Table 2. (4) Weka was used to implement leave-one-out 
cross-validation of a cost-sensitive Multilayer Perceptron (MLP) 
that was weighted to maximize balanced accuracy, defined by the 
mean of sensitivity and specificity (Bouckaert et al, 2010). Using 
this method, we examined our ability to diagnose either LTLE or 
RTLE from NES and assessed our ability to diagnose and lateralize 
disease simultaneously. For the remainder of this manuscript, the 
latter tool that discriminates LTLE vs. RTLE vs. NES is called the 
trinary classifier. Similarly, the binary CAD tools are referred to by 
the laterality of epilepsy that is being detected. The comparison 
to NES is not stated, but can be assumed. We then compared our 
CAD tool's performance to the results of MA alone. 

MACHINE LEARNING ALGORITHMIC DETAILS 

The MLP was implemented with default parameters in Weka 
(Bouckaert et al., 2010). All input features were normalized to 
values between negative and positive 1. No limit was set on the 
number of hidden layers or nodes within each hidden layer. These 
parameters were optimized within each training set indepen- 
dently. The learning rate and momentum were set to 0.3 and 0.2, 
respectively. Five hundred epochs were used for training. Dur- 
ing training, models with more than 20 consecutive errors were 
excluded. The trinary classifier was created by decomposing the 
three class problem into three 1 -against- 1 problems that were com- 
bined using majority voting. No three-way ties occurred during 
training or testing. 

Balanced accuracy was optimized using a cost-sensitive classi- 
fier in which a false positive was given a cost of n + and a false 
negative was given the cost of n_, where n + and n_ represent the 
number of PWE and NES in the full sample, respectively. In the 
trinary classifier, the cost was set as the sum of the number of 
patients in the other two diagnostic classes. 

Cyclical leave-one-out cross-validation (CLIOCV) was used to 
assess the performance of the MLP. In this paradigm, all but one 
patient was used to determine the features selected and train the 
algorithm. The single remaining patient is tested using the model 
built upon the other patients. The identity of the test patient is 
permuted until all patients have been the test case once and only 
once. 

To determine the number and identity of the input features, the 
mRMR algorithm requires the number of input features, F, and 
quantal levels, Q, be set a priori. For the calculation of mutual infor- 
mation, the features were smoothed into Q quantal bins akin to the 
bins in a histogram. Classification, however, utilizes unsmoothed 
features. The choice of input features smoothed into quantal levels 
was determined to be most representative of the performance of 
the algorithm across a wide variety of choices of F and Q (Kerr 
et al, 2012b). This choice was made by selecting a point within a 
region of F-Q parameter space that performed significantly better 



than the naive classifier with 95% confidence based on random 
field theory correction where the spatial smoothness is estimated 
directly from the data (for more details, see Worsley et al., 1992, 
2004; Chauvin et al., 2005). The naive classifier classifies all test 
exemplars as the most common class in the training set. Under the 
CLIOCV procedure, these input features were determined inde- 
pendently for each of the training samples. The illustrated rank 
order of features was calculated based on the full dataset, and 
does not necessarily match the rank list of any individual training 
sample. 

When clinical information was incorporated into the algo- 
rithm, the same methodology was applied as above, except that all 
exemplars with missing data were excluded from analysis. In these 
additional analyses, we did not re-sample the parameter space of F 
and Q. We simply used the selections determined in the previous 
analysis. 

MANUAL ANALYSIS OF PET AND MRI RECORDS 

Manual analyses of the iPET and sMRI records were performed 
based on the review of clinical records primarily written by Dr. 
Noriko Salamon. Dr. Salamon has 10 years of experience in the 
pre-surgical assessment of epilepsy using FDG-PET and MRI. All 
manual interpretation was conducted for the clinical assessment 
of each patient when it occurred, prior to the CAD tool devel- 
opment. Therefore, Dr. Salamon was blinded to the automated 
results. Due to the unclear relationship between structural and 
metabolic abnormalities, asymmetries, and epilepsy, all abnor- 
mal results were interpreted to be consistent with some form 
of epilepsy. Not all patients had sMRI (n = 6) and iPET («= 1) 
reports available; therefore all analysis regarding MA of neu- 
roimaging includes only patients with available records. These 
patients had raw iPET data available; they therefore were included 
in the automated analysis. 

COMBINATION OF CLINICAL INFORMATION WITH COMPUTER-AIDED 
DIAGNOSTIC INFORMATION 

To examine the combined power of clinical knowledge, MA and 
our automated analysis, we assessed the linear correlation of 
detecting epilepsy with CAD compared to MA, and also incorpo- 
rated clinical information and MA into our algorithm in two ways. 
First, the clinical literature suggests that patients with NES are 
more likely to be female, begin having seizures in the third decade 
of life, have a decreased duration of disease and have increased 
seizure frequency (Table 1). Although we did not see a significant 
difference seizure frequency within our dataset, we included this 
features to better match clinical practice. These clinical features 
were then added to the input and leave-one-out cross-validation 
was repeated. Secondly, to explore how our computational meth- 
ods can complement clinical wisdom, we included the results of 
MA of the iPET and sMRI as two additional input features and 
re-evaluated CAD performance. For the trinary classifier only, we 
split each of the features describing the iPET and sMRI MA to 
indicate if a left and/or right sided abnormality was reported. 

To assess the applicability of our CAD as a separate modal- 
ity that could be considered as part of the clinical assessment 
of epilepsy, we calculated the likelihood ratios (LRs) of each of 
the combinations of our CAD with MA of iPET and/or sMRI. 
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Table 1 | Clinical information and results of manual analysis. 







NES 


LTLE 


RTLE 


Age 


MeaniSD 


37± 14* 


38± 12 


36± 13 11 




Min-Max (Median) 


16-76 (38) 


18-54 (40) 


17-67 (35) 




N 


32 


39 


34 


Sex 


% Female ±SE 


78.1 ±7.3* § 


53.8 ±8.0 


35.3 ±8.2 


riiir^tinn nf rli^p^^p 


Mpan -I- SD 

Ivluul I _1_ O \—J 


12± 12* § 


22 ± 15 


20 ± 13 




Min-Max (Median) 


10d-40y (7) 


6m-53y (21) 


2y-48y (19) 


Seizure frequency 


MeaniSD 


3.2/d±5.9/d 


1.2/d±2.4/d 


1.5/w±1.7/w 




Min-Max (Median) 


0.3/m-25/d (3/d) 


0.2/m-11/d (1/w) 


0.1/m-1/d (0.8/w) 


iPET manual 


% Positive ±SE 


18.8±6.9* § 


76.9 ±6.7 


87.9 ±5.7 




N 


32 


39 


33 


sMRI Manual 


% Positive ±SE 


34.5±8.8* § 


73.7 ±7.1 


87.5 ±5.8 




N 


29 


38 


32 



This table reflects the clinical information known before the application of the CAD tool. All times are listed in years (y) unless otherwise specified by days (d), weeks 
(w), or months (m). Manual analysis of all patients' IPET and sMRI were not done, therefore we list the number with available manual results. * or * indicate that 
the value for NES vs. LTLE, NES vs. RTLE, or LTLE vs. RTLE, respectively, is statistically significant from both the LTLE and RTLE groups with at least 95% confidence 
using a two-sample z-test of proportions or Mann-Whitney U test, where appropriate. No other differences are statistically significant (p > 0. 10). 



This was done only for the binary classifiers, because LRs have a 
clear formulation only for binary outcomes. The likelihood ratio 
is defined by the likelihood that a patient with a certain combina- 
tion of diagnostic outcomes has epilepsy, divided by the likelihood 
that the same patient has NES. Intuitively, a likelihood ratio of two 
implies that the patient is twice as likely to have epilepsy. The 
95% confidence intervals of chance were calculated using exact 
binomial intervals by considering the likelihood ratio of a clas- 
sifier that diagnosed patients according to their prior likelihood 
alone, conditioned upon the assumption that the same total num- 
ber of patients would have the diagnostic outcome of interest. 
For example, 39 of 71 patients had LTLE when we discriminated 
between LTLE and NES, therefore the median LR is 1.2. Thirty-five 
patients from the NES vs. LTLE group had negative MA of their 
iPET. Therefore, we use a binomial distribution with 35 trials and 
success probability of 39 over 71 to yield a 95% confidence interval 
of 0.94-3.38. 

RESULTS 

All of our results are compared to the gold standard diagnosis 
from the consensus panel. The clinical trial statistics of each of our 
automated diagnostic tool matched, but were not redundant with, 
expert MA of both interictal PET and sMRI (Figure 1). All inter- 
vals reflect 95% confidence intervals and all p-values correspond 
to differences from anaive classifier. The binary CAD tool for 
RTLE had accuracy of 88% (69-90%), compared to the accuracy 
of MA of iPET [85% (72-92%)] and sMRI [77% (63-85)]. The 
binary tool for LTLE had accuracy of 83% (69-90%), compared to 
the accuracy of MA of iPET [79% (66-88%)] and sMRI [70% 
(56-81%)]. The pattern in sensitivities, specificities, and odds 
ratios all parallel this trend where our automated diagnostic tools 
are non-statistically superior to MA oriPET, which, in turn, are 
non-statistically superior to MA of sMRI (Figure 1). The accuracy 



of our trinary CAD tool that simultaneously diagnoses epilepsy 
and lateralize disease was 76% (66-84%), where 89% (77-96%) 
of patients correctly identified with epilepsy were also lateralized 
correctly. MA to diagnose and lateralize was 78% (69-86%) accu- 
rate with 89% (76-94%) correctly lateralized using iPET and 71% 
(61-80%) accurate with 91% (78-97%) correctly lateralized using 
sMRI. 

The rank order of the features used in our algorithm parallel 
the clinical observation that the epileptogenic networks in LTLE 
are broader than in RTLE. The LTLE vs. NES classifier achieved its 
performance by utilizing trends across almost the entire brain by 
including 42 of the 47 features in the final algorithm. In contrast, 
the RTLE vs. NES classifier only needed to measure the metabo- 
lism in six regions - bilateral temporal cortex and two associated 
regions of cortex- to achieve its impressive performance (Table 2). 
As expected, the trinary classifier utilized an intermediate number 
of features to achieve its accuracy (30 of 47). The rank list of 
these features matches the biological intuition based on knowl- 
edge about the potential connectivity of epileptogenic networks 
(Table 2). 

We then considered how this CAD information could be used 
in combination with clinical information or expert analysis. The 
squared correlation of our CAD tool with manually interpreted 
iPET was 0.25 (0.09-0.43), 0.32 (0.17-0.54), and 0.34 (0.17-0.46) 
for the LTLE, RTLE, and trinary classifiers, respectively (Figure 2). 
The squared correlation of our tool with manually interpreted 
sMRIwas0.07 (0.001-0.23), 0.21 (0.06-0.40), and0.ll (0.02-0.25) 
for the LTLE, RTLE, and trinary classifiers respectively. For com- 
parison, the squared correlation between manually interpreted 
iPET and sMRI was 0.17 (0.06-0.33). 

When the same automated analysis was used to combine 
clinical findings with our iPET data, performance did not change 
significantly. After the four clinical factors were added to the input 
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]CAD iPETdlMA iPETCTJMA sMRI □Gender 



Accuracy Sensitivity Specificity 
RTLE vs NES 



Accuracy Sensitivity Specificity 
LTLE vs RTLE vs NES 




Accuracy Sensitivity L Sensitivity R Specificity 

FIGURE 1 | CAD tool performance matches manual analysis. These 
figures indicate the accuracy, sensitivity and specificity of the LTLE (A), 
RTLE (B) and trinary (C) classifiers. The performance of our CAD tools 
matched that of MA and was superior to just using gender alone. The error 
bars indicate standard error of the mean performance for each measure. 
The translucent region indicates the performance of a na'ive classifier. 
"Indicates significant differences from the naive classifier with a confidence 
level of 95% or more. 



of our tools, the accuracy changed to 79% (66-88%), 68% (56- 
79%), and 64% (54-73%) for the LTLE, RTLE, and trinary classi- 
fiers, respectively (Figure 3). These accuracies do not substantively 



Table 2 | Ranked list of contributing metabolic ROIs. 



Region of interest 



mRMR rank LTLE vs. NES 



RTLE vs. NES 



Trinary 



Midbrain 

L ilp temporal C 

R ilp temporal C 

L associative visual C 

L Broca's Region 

L s frontal C 



R ila temporal C 
R ilp temporal C 
L sensorimotor C 
L si temporal C 
R thalamus 
R i frontal C 



R ila temporal C 
L ilp temporal C 
L sensorimotor C 
R ilp temproal C 
R si temporal C 
R pm temporal C 



This table illustrates the top six informative and non-redundant regions of inter- 
est (ROIs! that may contribute to each of the CAD tools, as determined by the 
minimum redundancy-maximum relevancy criteria (mRMR; Ding and Peng, 2005; 
Peng et al., 2005). The illustrated rank order of features was calculated based on 
the full dataset and does not necessarily match the rank list of any individual 
training sample. The leading L or R indicates left or right. The lowercase letters 
indicate inferior (i), lateral (I), median (m), anterior (a), and posterior (p). The lagging 
C signifies cortex. Note that the LTLE vs. NES and trinary classifiers include infor- 
mation from 42 and 30 ROIs, respectively. To better understand the benefit of 
mRMR, this list can be directly compared to the list of ROIs ranked by t-statistics 
in Table Al in Appendix. 




jTrinary 
□ LTLE 
■ RTLE 

I iCross Modality * 



lit 111 _ 



iPET 



sMRI 



FIGURE 2 | CAD tool is not redundant with manual analysis. The 

squared correlation of our CAD tools' results with those of MA of the iPET 
or sMRI from the same patients was below 50%. This indicates that while 
some information is shared, the majority of information provided by our 
CAD tools is not captured by MA. The correlation between MA of iPET and 
sMRI is similar in magnitude to the correlation of CAD with MA, therefore 
the CAD could potentially be seen as similar to another informative 
modality. "Indicates significant differences of the correlation from zero with 
a confidence level of 95% or more. 



change when only sex and duration of disease were considered 
(results not shown). Adding the results of MA of both iPET and 
sMRI to our iPET data changed the accuracy to 82% (73-91%), 
77% (67-88%), and 68% (59-77%) for the LTLE, RTLE, and 
trinary classifiers, respectively. When all information sources con- 
tribute to the algorithm, the accuracy changed to 77% (68- 
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FIGURE 3 | Automated combination of clinical information with 
automated analysis of iPET images. The automated combination of 
clinical information and/or MA with our analysis produced no significant 
change in performance for the LTLE (A), RTLE (B) or trinary (C) classifiers, 
relative to the CAD operating on automated values alone. The unshaded 
bars indicate the performance of similarly constructed CAD tools using 
clinical information or the results of MA alone. The shaded bars indicate the 
modified performance when information from NeuroQ is added. The 
horizontal line indicates the mean accuracy of each CAD tool without 
clinical information. The translucent region indicates the performance of a 
na'ive classifier. 



74% (64-85%), and 76% (68-84%) for the LTLE, RTLE, and 
trinary classifiers, respectively. 

We combined the results of MA were combined with our CAD 
tool manually using LRs. After doing so, the likelihood was gen- 
erally only significant if all considered modalities agreed. Viewed 
alone, MA and our CAD increased the likelihood of the predicted 
outcome between two and ninefold (p < 0.02; Figure 4A). When 
two analysis streams were combined, if both analyses agreed, the 
likelihood of the predicted outcome was increased between 8- and 
27-fold (p < 3 x 10~ 4 ; Figures 4B,C). If all three analyses agreed, 
the likelihood of the predicted outcome increased more than 15- 
fold (p < 1.3 x 10~ 5 ; Figure 4D). However, in most cases, if there 
was any disagreement, the likelihood did not change significantly, 
most probably due to the small numbers of patients with each 
potential outcome. There are two key exceptions: ( 1 ) Given iPET 
results indicating NES over RTLE using either MA or CAD, the 
sMRI could be largely ignored (p < 1.1 x 10~ 2 ). (2) If both MA 
and CAD of iPET agreed that a patient suffered from LTLE and not 
NES, the sMRI results could be similarly ignored (p < 3.3 x 10~ 2 ). 

DISCUSSION 

These results demonstrate how our CAD tool has the potential 
for clinically application, while also confirming and elucidating 
the distributed effects of epilepsy on the entire brain. Our CAD 
tool's diagnostic performance of TLE matches, but is not redun- 
dant with, expert MA of iPET and sMRI. When considered in 
the context of recent reports of CAD tools for epilepsy based on 
sMRI and interictal EEC data (Farid et al, 2012; Focke et al., 2012; 
Keihaninejad et al., 2012; Kerr et al., 2012a), CAD is proving espe- 
cially applicable to epilepsy. Further, if more work confirms the 
hypothesis that metabolic changes in iPET are observable before 
the structural changes in sMRI, our iPET tool may have better 
clinical utility than these existing sMRI tools. In contrast to MA, 
this and other CAD tools can be quickly and efficiently applied 
by minimally trained technicians, emergency physicians, and pri- 
mary care providers as preliminary analysis of the iPET images 
(van Ginneken et al., 2011; Kerr et al., 2012c). The performance 
of MA can vary with experience and fatigue of the observer; auto- 
mated tools are consistent over time. Upon further validation, 
these CAD results could also be incorporated into the consensus 
diagnoses with minimal cost if iPET already has been obtained. 

CLINICAL IMPACT 

Our CAD tools could provide valuable clinical information that 
may help readily identify which treatments may be effective 
in patients who present with uncharacterized, and/or medica- 
tion refractory seizures (Kerr et al., 2012a,c). In particular, 15 
of our 105 patients were admitted twice to achieve definitive 
characterization or localization of their seizures. The appropriate 
binary classifier correctly diagnosed 12 (80%) of these challeng- 
ing patients. This valuable information might reduce the need 
for multiple video-EEG admissions. Additionally, 28% (9/32) 
of our PWN were admitted for improved characterization of 
their previously-diagnosed "epilepsy," and 16% (12/73) of our 
PWE were admitted for the differential diagnosis of epilepsy, 
indicating that non-epileptic etiologies were not ruled out suf- 
ficiently. The trinary CAD effectively diagnosed 67% ( 14) of these 
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FIGURE 4 | Combination of clinical information and CAD results using 
likelihoods. Columns in this log plot above 1 indicate that the seizures are 
more likely to be epileptic whereas the columns below 1 indicate a 
non-epileptic etiology is more probable. (A) Illustrates the positive and 
negative likelihood ratio of each analysis method considered individually. 
(B,C) Illustrate the likelihood ratios of each possible outcome when two 
analysis methods are combined. (D) Indicates the likelihood ratios of each 
possible outcome when all analysis methods are combined. If all 
modalities agree, the likelihood non-significantly increases with the 



particularly challenging patients. Despite this impressive perfor- 
mance, the ultimate goal of CAD, however, is to complement - not 
replace - MA. 

COMBINATION OF AUTOMATED ANALYSIS WITH CLINICAL WISDOM 

Our finding that performance almost uniformly, but non- 
statistically, decreased when the automated algorithm incorpo- 
rated clinical information indicates that automated analysis can- 
not and should not replace manual interpretation across infor- 
mation modalities. We suspect that this performance decreased 
due to ineffective modeling of the contribution of the clinical 
information and over-fitting. The statistical distribution of the 
clinical factors was very different from the metabolic data therefore 
the same model likely cannot effectively utilize both modalities. 
The efficient incorporation of multimodality information into 
machine learning is an active area of theoretical research, and well- 
validated methods are not yet available. Now that CAD tools using 
interictal EEG (Kerr et al, 2012a), sMRI (Farid et al, 2012; Focke 
et al., 2012; Keihaninejad et al., 2012), and iPET have been pub- 
lished, we believe it will be extremely exciting to assess how these 
various tools can be combined. 

We expected that the best performance would be achieved when 
our CAD is used synergistically with MA. The low correlations 



addition of each modality. However, if there is disagreement, the likelihood 
ratio is generally not significantly different from chance. The translucent 
bars indicate the 95% confidence interval for chance with the relevant sign 
(see Materials and Methods). The numbers above the translucent bars 
indicate the total number of patients with each outcome. The bars that go 
off the scale of the graph diverge toward zero or infinity because no 
patients of a certain class had that outcome. "Indicates significant 
differences of the correlation from zero with a confidence level of 95% 
or more. 



between the CAD results and MA suggest that our CAD tool pro- 
vides information that is not evident on visual inspection. These 
results emphasize that PET is not redundant with MRI (Henry 
et al., 1999). Physicians could learn to view CAD as analogous 
to another imaging modality that provides valuable, but not per- 
fectly diagnostic, clinical insight. This synergistic application of 
computer-aided diagnosis after manual interpretation already has 
proven beneficial in the detection of lung nodules by the FDA 
and is an active area of translational research (Kerr et al., 2012c; 
Wang et al., 2012). The key differences between MA and auto- 
mated analysis are the ability to entirely ignore certain pieces of 
data, and to rule that the results are inconclusive. 

The results summarized above, and the LRs for each analy- 
sis stream individually, show that both MA and CAD are useful 
clinically. If the analysis streams agree, the diagnostic certainty 
increases substantially, but at a cost: as more analyses are added, 
more patients have inconclusive results because the analyses did 
not agree, and the LRs are not significant. Even though our sample 
size is large compared to other studies of this type, there were not 
enough patients in our dataset with each diagnostic outcome to 
explain the clinical implication of disagreeing analyses adequately. 
This matter of inconclusive results is a common challenge faced in 
clinical practice. Physicians struggle regularly with those types of 
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decisions. When MA of iPET and sMRI are combined, they need to 
agree to yield meaningful results. However, our analysis shows that 
in some specific cases, if both the MA and CAD of iPET agree, the 
sMRI is not needed. This parallels the finding we suggested above: 
iPET may be more clinically useful than sMRI to diagnose and lat- 
eralized epilepsy. The hypometabolic abnormality may be present 
earlier in disease (Theodore et al., 1990; Ryvlin et al, 1991; Swartz 
et al, 1992; Gaillard et al, 1995; Debets et al, 1997; Knowlton 
et al, 1997, 2008; Blum et al, 1998; Drzezga et al, 1999; Benedek 
et al, 2004; Carne et al, 2004; Chandra et al, 2006; Yun et al, 2006; 
Uijl et al, 2007; Willmann et al, 2007; Rastogi et al, 2008; Sala- 
mon et al, 2008; Duncan, 2009; Lee and Salamon, 2009; Lerner 
et al, 2009; Liew et al, 2009; Brodbeck et al, 2010; Chinchure et al, 
2010; Kim et al., 201 1; Jupp et al., 2012), and it may provide slightly 
more accurate disease characterization, as seen in our dataset. In 
settings where the PET scanner is not combined with the MRI 
scanner, and/or when the cost of imaging is a limiting factor (both 
common occurrences) the effective application of our CAD could 
result in substantial cost savings. 

PATHOPHYSIOLOGICAL INSIGHTS 

Our methods also reveal a potential difference in the pathophysi- 
ology of left vs. right TLE. This may help explain why CAD tools 
perform slightly better when diagnosing RTLE compared to LTLE 
(Farid et al, 2012; Focke et al, 2012; Keihaninejad et al, 2012). 
The finding that mostly bilateral temporal ROIs, the right inferior 
frontal cortex and left sensorimotor cortex provide non-redundant 
diagnostic information for RTLE is consistent with the clinical wis- 
dom that the epileptogenic network in RTLE is more focal than 
in LTLE. The inclusion of temporal regions echoes the conven- 
tional wisdom that focal hypometabolism and asymmetry reflect 
characteristic changes due to epilepsy. This suggests that conserva- 
tive resection of the temporal lobe may result in increased rates of 
seizure freedom in RTLE compared to LTLE due to complete resec- 
tion of the SOZ. Further, seizures that originate in the left temporal 
lobe may secondarily generalize more frequently in LTLE. These 
differences have not yet been studied clinically. 

The trends in the extratemporal regions included in the algo- 
rithms suggest that the primary lesion may induce metabolic 
changes in functionally or anatomically associated regions. This 
is substantiated further by the finding that almost all regions of 
the brain provide informative diagnostic information in LTLE. 
This in turn mirrors the increased stereotypic connectivity of the 
left temporal lobe. Even though the interconnectivity of the right 
hemisphere is higher than the left hemisphere, the left hemisphere 
has strong connections between specialized foci (Barrick et al, 
2005; Iturria-Medina et al., 2011; Kucyi et al, 2012). We hypoth- 
esize that the SOZ may induce abnormal metabolism along these 
strong, stereotyped connections. This change cannot be attrib- 
uted to language specifically in our dataset because we did not 
identify the laterality of language dominance in our patients. 
Compared to our f-statistics ranking, it may seem surprising that 
the metabolism of the midbrain was ranked first by mRMR for 
LTLE vs. NES. This rank may indicate a non-linear change in 
the metabolism within the dorsal midbrain anticonvulsant zone, 
which has itself been identified in animals to be part of the net- 
work that modulates seizure threshold (Shehab et al., 1995). The 



exact relationship between epilepsy and midbrain metabolism is 
unclear, however. The lack of distributed atrophy in LTLE mea- 
sured by sMRI suggests that these changes are not associated with 
distributed cell death or gliosis (Farid et al., 2012; Focke et al, 
2012; Keihaninejad et al, 2012). Instead, we hypothesize that this 
change instead reflects abnormal metabolism in these regions due 
to altered neural connectivity and/or activity secondary to the 
epileptogenic lesion. This is supported by the finding that LTLE 
was associated with more changes in functional connectivity than 
RTLE was (Pereira et al, 2010). This also explains why we observed 
metabolic changes in the right thalamus in RTLE: recent work 
demonstrates that the connectivity of the right thalamus with the 
right hippocampus is reduced in RTLE (Morgan et al., 2012). The 
presence of such distributed changes also supports the finding 
that the size of the hypometabolic lesion visualized on PET may 
be larger than the SOZ (Juhasz et al, 1999; Matheja et al, 2001; 
Henry and Roman, 201 1). It is particularly interesting to note that 
the extent of these distributed changes is underappreciated by f- 
statistics comparing LTLE to NES. This indicates that there is a 
complex, likely non-linear, relationship between the metabolism 
of the hypometabolic lesion and its associated tissue that may be 
better understood by mutual information. 

The inclusion of the contralateral hippocampus in both of the 
binary classifiers lends itself to multiple interpretations that are 
all supported by biologically sound hypotheses. Firstly, a salient 
feature of LTLE or RLTE could be asymmetric metabolism, as 
suggested clinically; therefore the metabolism of the contralat- 
eral hippocampus was compared to the observed metabolism in 
the ipsilateral hippocampus. Alternatively, the interhemispheric 
connectivity between the hippocampi is high, therefore under 
our hypothesis that changes in metabolism spread according to 
functional connections, the metabolism in the contralateral hip- 
pocampus may be one of the first induced changes due to the 
epileptic lesion. Lastly, if LTLE and RTLE are inherently bilateral 
diseases then the metabolism in the contralateral hippocampus 
may also be abnormal. This also provides an explanation for why 
LTLE and RTLE were not perfectly distinguished. 

In addition to diagnosing epilepsy, our algorithm lateralized 
disease efficiently with an accuracy of approximately 90% when 
epilepsy was diagnosed correctly. This impressive accuracy could 
be clinically useful for pre-surgical planning, when used in combi- 
nation with other clinical and radiological information. Although 
our current sample size is too small to fully assess this potential 
fully, our results suggest that similar methodology could be applied 
to a larger dataset with more diverse and specific SOZ localiza- 
tions to yield an objective and reliable tool to assist in pre-surgical 
SOZ localization. Our data suggest that this approach likely would 
identify and utilize distributed metabolic findings associated with 
each epileptic lesion to improve performance. Instead of blur- 
ring the boundary of the SOZ by detecting affected tissue outside 
the SOZ, the improved understanding of these distributed effects 
may lead to more refined characterization of this clinically vital 
SOZ. However, the spatial resolutions of our outcome classes were 
insufficient to assess the utility of this method directly to identify 
candidate lesions for resective surgery. 

While our lateralization accuracy is exciting, there is also 
a potential clinical interpretation of the patients who were 
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falsely-lateralized. Functional connectivity between the temporal 
lobes is particularly strong. In a minority of patients, this con- 
nectivity allows epileptogenic activity to spread quickly from the 
SOZ to the contralateral temporal lobe on EEG, resulting in the 
appearance of either bilateral or falsely-lateralized disease. Simi- 
larly to the distributed networks discussed above, this high degree 
of functional connectivity also may induce metabolic abnormali- 
ties in the contralateral temporal lobe that may be indistinguish- 
able from the primary lesion. This hypothesis can be tested by 
comparing these falsely-lateralized patients to patients with bilat- 
eral TLE. This comparison requires a detailed methodological 
treatment of non-mutually exclusive classes in machine learning 
and therefore lies outside the scope of the current manuscript. 

To characterize these and other pathophysiological insights, 
most studies utilize healthy neurologically normal controls. In 
contrast, we prefer the use of PWN as our control group. In brief, 
when constructing a control group, one aims to match the patients 
in the pathologic group in all aspects other than the pathology. 
In contrast to neurologically normal controls when compared to 
PWE, PWN's have been exposed similarly to AEDs and other med- 
ications, have increased prevalence of TBI and some other risk 
factors for epilepsy (Sahaya et al, 201 1), have regular and frequent 
meetings with health care providers, and have much more strict 
inclusion criteria. Lastly, and perhaps most importantly, physicians 
do not consider whether all of their patients have epilepsy; they 
assess only the patients with seizures. Therefore, in our opinion, 
the use of PWN as the control group is a benefit in of our study 
because it maximizes the clinical relevance of our results while 
simultaneously improving its statistical selectivity. 

LIMITATIONS AND FUTURE DIRECTIONS 

Because our retrospective dataset was collected as part of clinical 
care, our approach has a few important limitations. The accuracy 
of MA reported in our patients is worse than the rates quoted 
in previous literature (Rastogi et al., 2008; Salamon et al., 2008; 
Lee and Salamon, 2009). Given UCLA's status as a tertiary refer- 
ral center, the decrease in manual accuracy likely indicates that 
our patients had more heterogeneous etiologies and/or were more 
complex and difficult to diagnose than other centers. This sug- 
gests that our CAD tool may perform better on other datasets. Our 
iPETs and MRIs were collected on varying cameras with varying 
resolutions. This demonstrates the flexibility of our automated 
analysis using NeuroQ. The efficacy of the MA of older and lim- 
ited resolution data may not be comparable to that of more current 
and higher resolution data. After establishing the efficacy of our 
method, we plan to both validate our tool prospectively on data 
from other centers, and to incorporate multi-center data into our 
algorithm to further improve its performance. Additionally, we 
only discuss the combination of CAD results with independently 
derived MA. Future work will examine the efficacy of CAD tools 
informed by MA and vice versa. 

Critics of our approach might claim that the significant gender 
and age difference of the patients with NES compared to PWE 
may lead to our CAD simply detecting the age and/or gender of 
the patients. While we do not expect this to be the case for RTLE, 
the utilization of language areas by the LTLE classifier might reflect 
differences in gender, and not epileptogenic pathology. However, 



the performance of our CAD was significantly higher than when 
clinical information was used directly, therefore the algorithm 
utilized more information than just clinical data to achieve its 
strong performance. These significant differences in clinical fac- 
tors largely mirror the observed differences in clinic; therefore our 
dataset better matches the population for which our CAD tool 
would be applied. The only notable exception is the significant age 
difference between LTLE and RTLE, which was unexpected. Due 
to the naturalistic nature of our data collection scheme, we did not 
correct for this difference. However, we note similarly to the NES 
group, the use of age alone was significantly worse than our tool 
and the addition of age to the iPET data to control for its effect 
did not significantly change performance. 

Another key caveat to the direct clinical application of our tool 
to clinical practice is the fact that epilepsy is an extremely het- 
erogeneous disease. The generalization of our method to bilateral 
TLE, extratemporal foci and multifocal epilepsy will be critical 
before it can be incorporated into clinical practice. In particular, 
even though NES mimic all types of seizures, it is uncommon 
for TLE to be mistaken for NES. Instead, it is more common 
that NES appear to have a focus in frontal cortex (LaFrance and 
Benbadis, 2011). Therefore, the literature suggests that the high- 
est impact CAD tool would discriminate between frontal lobe 
epilepsy and NES and another, separate tool could be used to 
lateralize TLE. Based on our results above (see section Clinical 
Impact), we believe that our TLE-specific tool may be clinically 
applicable. For the first publication demonstrating the applica- 
bility of computer-aided diagnosis based on iPET data, we chose 
to focus on the diagnosis and lateralization of TLE, based, based 
on prior findings that the sensitivity of iPET is highest for TLE. 
Our future work then can address generalizing our methods 
to the other epilepsies, including bilateral TLE and frontal lobe 
epilepsy. 

CONCLUSION 

Despite a few caveats, and upon further validation with data from 
other centers, our automated methods could provide unique infor- 
mation for the effective and efficient characterization of epilepsy, 
with the potential to decrease the fraction of patients with NES 
that are being treated (inappropriately) with AEDs, and to more 
quickly triage patients with medication refractory epilepsy toward 
surgical intervention. This may help achieve the ultimate goal: a 
global reduction in seizures (Engel et al, 2012). 
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APPENDIX 

Table A1 | This table illustrates the ranking of informative regions of 
interest (ROIs) based on the maximum magnitude of f -statistic 
across the three contrasts. 
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1.450 


-2.343* 


2.995** 


Lam temp C 


-1.349 


1.415 


-2.981** 


Ram temp C 


2.176* 


-1.234 


2.986** 


Ri parietal C 


2.975** 


1.266 


2.072* 


R parietotemporal C 


2.246* 


-0.325 


2.720** 


Ls frontal C 


-1.828 


0.545 


-2.389* 


Rilp temp C 


0.447 


-2.321* 


2.361* 


La cingulate C 


1.274 


2.320* 


-1.021 


L thalamus 


-1.663 


-0.192 


-2.236* 


Lpm temp C 


-1.212 


1.069 


-2.189* 


Ri frontal C 


-0.742 


-2.172* 


1.409 


Lsl temp C 


-1.664 


1.000 


-2.163* 


L lentiform nucleus 


-1.115 


1.399 


-2.032* 


Lm frontal C 


-1.949 


0.244 


-2.001* 


Li frontal C 


-1.930 


0.059 


-1.971 


Ls frontal C 


1.536 


-0.351 


1.943 


Rm frontal C 


1.482 


-0.280 


1.807 


Rm frontal C 


0.878 


-0.883 


1.726 


Rp cingulate C 


0.034 


-1.215 


1.712 


R sensorimotor C 


1.629 


0.989 


0.750 


Li parietal C 


1.576 


1.361 


0.193 



(Continued) 



Region of interest LTLE vs. NES RTLE vs. NES LTLE vs. RTLE 



R Broca's region 


1.108 


-0.442 


1.570 


R caudate nucleus 


1.462 


0.925 


0.559 


L caudate nucleus 


-0.937 


0.599 


-1.460 


L Broca's region 


-1.028 


0.668 


-1.427 


R primary visual C 


-0.237 


-1.362 


1.393 


Lm frontal C 


1.096 


1.370 


-0.291 


Ra cingulate cortex 


1.286 


0.407 


0.889 


R thalamus 


0.266 


-0.944 


1.282 


R lentiform nucleus 


0.947 


0.442 


0.271 


L primary visual C 


-0.944 


-0.149 


-0.810 


L assoc visual C 


-0.142 


-0.861 


0.660 


Vermis 


-0.197 


0.507 


-0.701 


Pons 


0.667 


0.473 


0.153 


L parietotemporal C 


-0.569 


-0.530 


-0.028 


R cerebellum 


-0.087 


0.406 


-0.535 


Lp cingulate C 


-0.211 


-0.498 


0.393 


L cerebellum 


-0.461 


-0.321 


-0.054 


Midbrain 


-0.383 


0.100 


-0.435 


Ls parietal cortex 


0.093 


0.195 


-0.096 



Negative t-values in the first two columns indicate the hypometabolism in 
epilepsy. Negative t-values in the last column indicate hypometabolism in LTLE 
compared to RTLE. The leading L or R indicates left or right. The lowercase let- 
ters indicate inferior (i), lateral (I), median (m), anterior (a), and posterior (p). The 
lagging C signifies cortex. Bold indicates significant differences at greater than 
the 95% confidence level. The markings of *, **, and *** indicate significance 
at the 95, 99, and 99.9% confidence level, respectively, without multiple testing 
correction. No t-statistics remain significant at the 95% confidence level after 
Bonferroni correction. 
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